W+Jets at CDF: Evidence for Top Quarks 
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Recently, an anomaly of W+jets events at large invariant masses has been reported by CDF. 
Many interpretations as physics beyond the Standard Model are being offered. We show how such 
an invariant mass peak can arise from a slight shift in the relative normalization of the top and 
WW backgrounds. 



In recent years, the Tevatron experiments have run a successful search program studying weak gauge boson and 
top quarks we well as searching for a Higgs boson and for new physics. A specific search for y+jets production 
(V = W,Z) [IH3] follows a long list of motivations [4j: we can test QCD effects such as the so-called staircase scaling 
of n-jet production [5], we can search for triple gauge boson couplings in W + W~ and W Z production [TJ[B], we can 
search for technicolor signals [3 [5], or in the case of two bottom jets we can look for WH associated production [5]. 
Some of these channels include a study of the invariant mass of the two leading jets recoiling against a leptonically 
decaying W boson 

pp -> (W -> Iv) + 2 jets + X [t = fjb, e) (1) 

In their published study of WV production based on an integrated luminosity of 3.9 fb _1 and focused on an invariant 
mass regime rrijj = 50 — 130 GeV the CDF collaboration has started to observe a slight excess of events in the region 
of rrijj = 150 — 180 GeV [Ij. A DO search based on the lower luminosity of 1.1 fb^ 1 does not show any excess in this 
mass range [TU] , 

More recently, the CDF collaboration has published a dedicated study of the same anomaly [2] with harder back- 
ground rejection cuts and reports a 3.2 a anomaly in the rrijj spectrum. The excess is compatible with a resonance 
around 150 GeV. Many papers have since been published, explaining this observation, including technicolor |11) . 
supersymmetric [T2], lepto-phobic Z' boson [T3], color octets [14] , and other interpretations 15j. In this paper we 
suggest an explanation of the excess based on a slight relative shift of the weight of different background contributions 
on the WV pole an in the higher-mass region. While it is certainly possible to relieve the tension of the measurement 
and the background prediction for example by a shift in rrijj or through the heavy flavor content of the proton, to 
our knowledge ours is the only way to explain the observed kinematic feature within the Standard Model. 



A second peak from top decays 

One of the backgrounds to ly+jets production is the production of top quarks. Unlike to all other Standard Model 
channels, top quarks lead to a second peak in the rrijj distribution, in addition to the W mass peak. The angular 
correlation behind this second peak is between the bottom and the up-type quark q-f from the W-decay. In the W 
rest frame the distribution is given by 

P(cos 0) = f (l + costf) 2 F R +- (l-costf) 2 F L + - A sin 2 9F . (2) 
8 8 4 

In the Standard Model the relative size of these contributions is F : Fx : Fr ~ 0.7 : 0.3 : [16]. The corresponding 
invariant mass is 



P( mbqt )= Mr)FR + fL{r)FL + Mr)Fo with r =^ = W^^, (3) 

" l bj " l bj V Z 

!r{t) = 6r(l — r 2 ) 2 , ft(r) — 6r 5 , and fo(r) — 12r 3 (l — r 2 ). Its upper endpoint is m£J ax = \/ m 2 — m 2 ^ = 154.6 GeV, 
neglecting the bottom mass. 

The theory prediction for the mb qt distribution we show in Fig. [T| Because of the left-handed W interaction mj qt 
gets contributions from fx, and fo; mj^ corresponds to exchanging fx and fxt- Experimentally, we cannot distinguish 
between and q^, so instead we define the invariant mass rribj 1 with the harder of the two W decay jets. This 
distribution is harder than . Without b tagging the only observable distribution is mj t j 2 , using the hardest two 
jets from the top decay. It shows a double peak structure from the sum of the W peak and the rribj 1 distribution. In 
Fig. [l]we also show how a stricter jet veto not only reduces the number of events but also produces a harder second 
peak in rrijj. 
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Figure 1: Left: theory predictions for the m,bq f , rnb qi and rribj 1 distributions at the parton level, simulated for top pair 
production. Right: raj 1 j 2 distribution with pr,j 3 < 20 GeV (dashed), 30 GeV (solid) and 40 GeV (dotted). 

Loose cuts 

In this first part of our paper we look at the original WV analysis with the less significant but nevertheless clearly 
visible excess, shown in the left panel of Fig. [2] [TJ [T7]. The basic acceptance and background rejection cuts are on 
one lepton and at least two jets plus missing transverse energy with 

E T ,e > 20 GeV j> T < 25 GeV M T ,w > 30 GeV 

£ Tj >20GeV |>fe| < 2.4 |A^ rj J>0.4 

prji> 40 GeV |At7 w |<2.5. (4) 

The main background is VF+jets production with a variable normalization which can be fixed from the shape of the 
rrijj distribution. This background shows essentially no structure. The second background is QCD jet production 
faking a lepton and missing transverse energy. For a W decaying to an electron this background is about four times 
the size of the muon decay signature |17j . Again, this background has no visible structure in rrijj . 

Of roughly similar size is the top background, consisting of top pairs and of single top production. As discussed 
above, this background has a distinct shape, namely two peaks including a Jacobian peak around 140 GeV. We see 
this shape in the right panel of Fig. [2] The peak arises if we combine the b jet with one of the two light-flavor jets from 
the W decay, which means it gets contributions from top pair production and from single top production with a W 
boson. In the analysis, this background is normalized to the theory predictions cr t j = 7.5 pb and <7 s i ng iet = 2.9 pb 20J. 
The signal in this analysis is WV production. It has a clear peak dominated by W + W~~ production at rrijj = 80 GeV, 
smeared by the experimental resolution. Its extracted rate, corrected to the total cross section without any detector 
effects or branching ratios is 13.5 ± 4.4 pb for electrons and 23.5 ± 4.9 pb for muons. In combination this gives 
18.1 ± 3.3(stat) ± 2.5(syst) pb. This combined number is compatible with the theory prediction. 

However, the two significantly different results for the electron and the muon analyses with their different background 
compositions mostly in the Z+jets and QCD jets channels raise the question how well we actually know the total 
composition of all backgrounds. For backgrounds which do not have a distinct rrijj shape this question is not very 
relevant, but for the top background and the WV signal it matters. In the right panel of Fig. [2] we first show the 
individual templates for the top background and for the WV channel. Our simulation is based on Alpgen [TH] + 
Pythia [19] at the particle level. To model the measured rrijj distribution we apply a Gaussian smearing. Our 
template rrijj distributions reproduce the CDF results Q~7]. The normalization we fix to the 4.3 fb _1 of Ref. [T7], to 
properly take into account detector effects and efficiencies. This means that whenever we discuss the normalization 
of different cross sections we refer to the total rate after efficiencies and detector effects. 

The difference between the two templates becomes relevant if we change the relative contributions of the top and 
WV backgrounds. The difference clearly matches the slight observed excess. To quantify this effect we compute the 
change in event numbers associated with a shift of the integrated rate or efficiency. We independently consider the 
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Figure 2: Left: excess in the rrijj distribution above the WV peak, as reported by CDF [Q. The electron and muon decay 
channels are added. Right: rrbjj templates for the WV and top samples individually. The normalization is chosen to match the 
CDF data. We also show the difference between the two samples for a 10% change of <7 top and a corresponding shift in awv, 
as described in the text. 



peak region and the high mass regime 

A %4 , 96] = 926 542 

&wv "top 

AiV [120 , 170] = 88 Aff ^ + 915 . (5) 

&WV Ctop 

These event numbers correspond to the CDF analysis [17]. Requiring that the sensitive normalization of the WV 
mass peak rnjj = 64 — 96 GeV be unchanged relates the two shifts as (Aawv)/ (J wv = —0.59 (Acr top ) /<r top , assuming 
efficiencies do not vastly vary between the two mass windows. Using this relation we find a net shift in the high mass 
region 

A% 20 ,i70] = 863 ^£ . (6) 

"top 

Throughout this paper a really means the cross section after cuts and efficiencies, i.e. a x e cu t s x e rcc . The shape 
of the difference we show in Fig. [2] for (Aotop)/ctop = 10%. The experimentally observed excess for the loose set 
of cuts has the same shape. In Fig. [2] the mass window rrijj = 120 — 170 GeV includes roughly 100 events which 
usually are attributed to the WV contribution and any kind of new physics. If we conservatively neglect possible WV 
contributions, according to Eq.dHl this corresponds to an O(10%) shift in the combined top rate. 

For the sum of top pairs and single top production with its different hard processes this shift could arise from a 
combination of experimental efficiencies and distributions mostly of the many jets involved. For example, the number 
of events which we expect from the combined top sample is very sensitive to the px,j requirements we apply. Moreover, 
from the CDF publications [TJ [17] it is not clear how exactly the tW single top channel has been computed [22] . Its 
size before cuts ranges around 1% of the top pair cross section [5T], but after the cuts Eq.Q it could well account for 
a larger fraction of the shift in relative normalization. 

The compensating shift in the WV rate is even smaller and clearly within the sizable uncertainties of up to O(30%) 
for the individual decay channels. In short, a very slight shift of the top sample normalization after cuts and efficiencies 
compensated for by a shift of the WV rate completely explains the observed bigh-m,j anomaly. We should, however, 
remark that this loose cuts analysis is not a serious challenge to Standard Model explanations. It only serves as a 
way to illustrate and check our approach before we apply it to the more challenging dedicated analysis [2]. 
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Figure 3: Left: excess in the rrijj distribution above the WV peak, as reported by CDF [JJ. The electron and muon decay 
channels are added. Right: rrijj templates for the WV and top samples individually. The dark lines assume Etj = 30 GeV 
for the jet criteria, the lighter lines 40 GeV. We also show the difference between the two samples for a 40% change of o" top and 
a corresponding shift in awv, as described in the text. 



Hard cuts 



After observing the rrijj anomaly in their WV analysis CDF performed a dedicated analysis of this shape. To focus 
on the high-mass regime and to remove backgrounds they change some of the cuts shown in Eq. Q to 

(1) exactly two jets with Ex.j > 30 GeV (2) additional dilepton veto . (7) 

As we will see later, the veto on three or more jets makes a big difference, both in the extraction of the signal and in the 
uncertainties on the background estimates. Unlike for the loose cuts this experimental analysis show a distinct excess 
in Fig. [31 The additional requirements affects the relative composition of all channels in the rrijj = [28, 200] GeV 
window~[l7| . For example, WV production now contributes 6.4% of all events, compared to 3.4% for the loose cuts. 
The top contribution very slightly decreases from 6.0% to 5.8%. For the two mass windows we now find 

AA [64 , 96] = 475 Aa ^+ 137 ^ 

&WV °top 

AiWo] = 45^^ + 244 ^. (8) 

Again, we use a for the cross section after cuts and efficiencies, i.e. a x e cu t s x free- The relative normalization is 
fixed by the WV peak region, giving us (Aawv) / &wv — —0.29 (A<7top)/ctop and 

A% 20 ,i70] = 231 . (9) 

Ctop 

Naively, we see around 230 events in the high mass region rrijj — 120 — 170 GeV. From this number we have to subtract 
the number of events which are described by the WV channel, including systematic uncertainties. This leaves us with 
around 150 events which can for example be explained by a Gaussian new physics contribution. 

However, this number of events changes after a more careful study of the rrijj distribution. First, in the rrijj = 
170 — 250 GeV range we see a significant tail, consistently 10 to 20 events above the WV expectations. They might be 
explained by some kind of continuous background which would also contribute to the rrijj = 170 — 250 GeV window. 
Secondly, under the WV peak of Fig. [3] there are clearly events missing, of the order of 50. Our simple compensation 
of the WV and top channels cannot account for them because they are missing in the left side of the peak. Standard 
Model channels which rapidly drop towards larger rrijj values should help explaining them. This way we would slightly 
decrease the number of events missing in the higher mass regime. 
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Nevertheless, explaining an excess of more than 100 events in the rrijj — 120 — 170 GeV requires a sizable shift in 
the normalization of the top sample. Eq.S implies Atx t op ^ 0.43<7t op and a compensating shift in the WV rate of 
the order of 0(10%). 

Of course, this does not mean a 43% shift in the theoretically predicted total cross section for top production. 
Almost a third of the the combined top sample is single top production. For the jet veto survival probability the CDF 
analysis includes neither a reliable experimental [23] nor a reliable theoretical estimate [21 j . Thus, we expect a very 
large error bar on the single top rate after cuts and efficiencies. Top pair production might not be quite as critical 
because the parton shower approximation should describe jets properly [51 124] . 

All efficiencies very strongly depend on the detailed simulation of the QCD jet activity and the pr requirements. 
For example, if we increase the detection and veto threshold from 30 GeV to 40 GeV the over-all efficiency increases 
quite dramatically for the top sample, as shown in Fig. [3] and expected from Fig. [T] In addition, it changes the shape 
of the top template. A reduced efficiency for WV events means that instead of Eq.(|9| we find (Aawv /o'wv) — 
— 0.68 (AotopVctop an d makes it easier to explain the second peak. This indicates large theory and systematic 
uncertainties associated with the jet veto. The fact that it is challenging to describe the top sample after jet related 
cuts is illustrated by the poor separation of different single top channels in the corresponding CDF analysis [53]. We 
check that the corresponding uncertainty for loose cuts without a jet veto is very well under control. 

Taking our 40 GeV templates at face value the required change in the combined top rate drops significantly, entirely 
due to a strong dependence on the poorly understood jet veto survival probability. In essence, subtracting combined 
top backgrounds after a jet veto combines too many caveats which have to be taken into account as correspondingly 
large systematic and theoretical uncertainties*. 

Summary 

We have shown that the apparent excess in VF+jet events can be explained by Standard Model top backgrounds. 
Hadronically decaying top quarks generically produce two peaks in the rrijj distribution. To explain the CDF mea- 
surements we have to enhance the normalization of the combined top pair and single top templates after cuts and 
detector efficiencies. Given the inherent difficulties in quantifying jet veto survival probabilities, such a shift in the 
10% (for the WW analysis without a jet veto) or the 40% (for the high-mass analysis with a jet veto) range appears 
reasonable and expected from QCD considerations. To maintain the measured event numbers under the WW peak 
we compensate for this shift in the top template with another shift in the WW normalization. The latter does not 
exceed 10% and is well within the uncertainties indicated by the different CDF results for the individual electron and 
muon channels. 

Note added: after this work was finished, another paper with very similar conclusions appeared [25) . 
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